AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Efficient quantization deployment

# Efficient quantization deployment

Mistralai Mistral Small 3.2 24B Instruct 2506 GGUF
Apache-2.0
This is the Llamacpp imatrix quantized version of the Mistral-Small-3.2-24B-Instruct-2506 model, offering various quantization types to meet different hardware requirements.
Large Language Model Supports Multiple Languages
M
bartowski
3,769
12
Mistralai Devstral Small 2505 GGUF
Apache-2.0
Quantized version of Devstral-Small-2505, supporting multilingual text generation tasks, suitable for local deployment and inference.
Large Language Model Supports Multiple Languages
M
bartowski
4,817
10
Mixtral 8x22B Instruct V0.1 GGUF
Apache-2.0
A GGUF quantized version based on the mistralai/Mixtral-8x22B-Instruct-v0.1 model, supporting multi-language text generation tasks
Large Language Model Supports Multiple Languages
M
MaziyarPanahi
1,333
33
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase